Integrative random forest for gene regulatory network inference

نویسندگان

  • Francesca Petralia
  • Pei Wang
  • Jialiang Yang
  • Zhidong Tu
چکیده

MOTIVATION Gene regulatory network (GRN) inference based on genomic data is one of the most actively pursued computational biological problems. Because different types of biological data usually provide complementary information regarding the underlying GRN, a model that integrates big data of diverse types is expected to increase both the power and accuracy of GRN inference. Towards this goal, we propose a novel algorithm named iRafNet: integrative random forest for gene regulatory network inference. RESULTS iRafNet is a flexible, unified integrative framework that allows information from heterogeneous data, such as protein-protein interactions, transcription factor (TF)-DNA-binding, gene knock-down, to be jointly considered for GRN inference. Using test data from the DREAM4 and DREAM5 challenges, we demonstrate that iRafNet outperforms the original random forest based network inference algorithm (GENIE3), and is highly comparable to the community learning approach. We apply iRafNet to construct GRN in Saccharomyces cerevisiae and demonstrate that it improves the performance in predicting TF-target gene regulations and provides additional functional insights to the predicted gene regulations. AVAILABILITY AND IMPLEMENTATION The R code of iRafNet implementation and a tutorial are available at: http://research.mssm.edu/tulab/software/irafnet.html

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance and Improvement of Tree-Based Methods for Gene Regulatory Network Reconstruction

Computational reconstruction of gene regulatory networks (GRNs) from gene expression data is of great importance in systems biology. Dialogue for Reverse Engineering Assessments and Methods (DREAM) challenge aims to evaluate the success of computational GRN inference algorithm on benchmarks of simulated data. Tree-based methods, such as Random Forest, infer true regulators of a target gene in a...

متن کامل

A Learning Framework to Improve Unsupervised Gene Network Inference

Network inference through link prediction is an important data mining problem that finds many applications in computational social science and biomedicine. For example, by predicting links, i.e., regulatory relationships, between genes to infer gene regulatory networks (GRNs), computational biologists gain a better understanding of the functional elements and regulatory circuits in cells. Unsup...

متن کامل

Predictive regulatory models in Drosophila melanogaster by integrative inference of transcriptional networks.

Gaining insights on gene regulation from large-scale functional data sets is a grand challenge in systems biology. In this article, we develop and apply methods for transcriptional regulatory network inference from diverse functional genomics data sets and demonstrate their value for gene function and gene expression prediction. We formulate the network inference problem in a machine-learning f...

متن کامل

Supplementary Information Predictive regulatory models in Drosophila melanogaster by integrative inference of transcriptional networks

S1 Fraction of integrative network edges with and without physical support . . . . . 4 S2 Degree distributions of integrative networks . . . . . . . . . . . . . . . . . . . . . 5 S3 Structural properties of integrative networks . . . . . . . . . . . . . . . . . . . . . 6 S4 Network motifs of integrative networks . . . . . . . . . . . . . . . . . . . . . . . . 7 S5 Prediction of novel functional...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 31  شماره 

صفحات  -

تاریخ انتشار 2015